Evaluation of Automatically Identified Index Terms for Browsing Electronic Documents
نویسندگان
چکیده
1. Abstract We present an evaluation of domainindependent natural language tools for use in the identification of significant concepts in documents. Using qualitative evaluation, we compare three shallow processing methods for extracting
منابع مشابه
Improving browsing in digital libraries with keyphrase indexes
Browsing accounts for much of people’s interaction with digital libraries, but it is poorly supported by standard search engines. Conventional systems often operate at the wrong level, indexing words when people think in terms of topics, and returning documents when people want a broader view. As a result, users cannot easily determine what is in a collection, how well a particular topic is cov...
متن کاملSearching Documents with Semantically Related Keyphrases
In this paper, we present a tool, called SemKPSearch, for searching documents by a query keyphrase and keyphrases that are semantically related with that query keyphrase. By relating keyphrases semantically, we aim to provide users an extended search and browsing capability over a document collection and to increase the number of related results returned for a keyphrase query. Keyphrases provid...
متن کاملAssociating Documents to Concept Maps in Context
To be useful, automatic document classification systems must accurately place documents in categories that are meaningful to users. Because concept mapping externalizes humans’ conceptualizations of a domain, concept maps provide meaningful categories for organizing documents. Since electronic concept-mapping tools provide mechanisms for using concept maps for effective document access, using c...
متن کاملSearching and Browsing Collections of Structural Information
This paper proposes a new approach to querying collections of structured textual information such as SGML/XML documents. Knowledge about the structure of documents is an additional resource that should be exploited during retrieval since the semantics of the different textual objects can be used to specify an information need much more precisely. However, the traditional probabilistic retrieval...
متن کاملA Framework for Structuring Multimedia Archives and for Browsing Efficiently through Multimodal Links
This thesis proposes a method for indexing and browsing archives of multimedia documents, and in particular meeting recordings, using printable documents and links. Existing systems for indexing and browsing multimedia data have four main limits. First, the indexing requires high-level abstractions extracted from multimedia documents, which is still an unsolved problem for rich media such as im...
متن کامل